Design Exploration of an Instruction-Based Shared Markov Table on CMPs

نویسندگان

  • Lixin Su
  • Karthik Ramachandran
چکیده

Our project starts from investigating if instruction-based sharing exists on commercial workloads such as Apache, Zeus, Jbb, and Oltp running on CMPs. We find that there is a large amount of instruction-based sharing on CMPs. Constructive interference at the instruction cache miss level also exists among different CMP cores. We further study if the implementation of a shared Markov table can help reduce L1 instruction cache misses for each CMP core. We find that a reasonably small shared Markov table, varying from 4K entries to 32K entries, can help reduce L1 instruction cache misses and can potentially evenly increase each CMP core’s performance and thus the overall CMP performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design Space Exploration of CMPs with Caches and Local Memories

Chip multiprocessors (CMPs) are the dominating architectures nowadays. There is a big variety of designs in current CMPs, with different number of cores and memory subsystems, because they are used in a wide spectrum of domains and so its best configuration highly depends on several design goals such as performance, energy consumption, scalability, area and programmability. This paper studies d...

متن کامل

Analytical Modeling of Partially Shared Caches in Embedded CMPs

In modern ubiquitous devices, optimizing shared lastlevel caches (LLCs) in embedded chip multi-processor systems (CMPs) is critical due to the increased contention for limited cache space from multiple cores. We propose cache partitioning with partial sharing (CaPPS) to reduce LLC contention and improve utilization. CaPPS can reduce the average LLC miss rate by 25% and 17% as compared to baseli...

متن کامل

Judicious Thread Migration When Accessing Distributed Shared Caches

Chip-multiprocessors (CMPs) have become the mainstream chip design in recent years; for scalability reasons, designs with high core counts tend towards tiled CMPs with physically distributed shared caches. This naturally leads to a Non-Uniform Cache Architecture (NUCA) design, where onchip access latencies depend on the physical distances between requesting cores and home cores where the data i...

متن کامل

The Impact of Collegial Instruction on Peers’ Pedagogical Knowledge (PK): An EFL Case Study

Shared responsibilities such as mentoring, instruction, learner monitoring and classroom management enable the peers to observe, review, reflect on and learn from the overall practical professional expertise of one another through collegial instruction experience. The present exploratory case study has The present exploratory case study has attempted to study collegial teaching as an innovative...

متن کامل

Micro-Profiler : A Fine-grained Application Profiler for ASIP Design

Current Application Specific Instruction set Processor (ASIP) design methodologies are mostly based on iterative architecture exploration that uses Architecture Description Languages (ADLs) and retargetable software development tools. However, for improved design efficiency, additional pre-architecture exploration tools are required to help narrow-down the huge design space and making coarsegra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003